Noise-robust ASR by Using Disti Approximated with Logarithmic No

نویسنده

  • Takashi Fukuda
چکیده

Various approaches focused on noise-robustness have been investigated with the aim of using an automatic speech recognition (ASR) system in practical environments. We have previously proposed a distinctive phonetic feature (DPF) parameter set for a noise-robust ASR system, which reduced the effect of high-level additive noise[1]. This paper describes an attempt to replace normal distributions (NDs) of DPFs with logarithmic normal distributions (LNDs) in HMMs because DPFs show skew symmetry, or positive and negative skewness. The HMM with the LNDs was firstly evaluated in comparison with a standard HMM with NDs in an experiment using an isolated spoken-word recognition task with clean speech. Then noise robustness was tested with four types of additive noise. In the case of DPFs as an input feature vector set, the proposed HMM with the LNDs can outperform the standard HMM with the NDs in the isolated spoken-word recognition task both with clean speech and with speech contaminated by additive noise. Furthermore, we achieved significant improvements over a baseline system with MFCC and dynamic feature-set when combining the DPFs with static MFCCs and ∆P.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Normalized Autocorrelation based Features for Robust Speech Recognition in Context with Noisy Environment

This paper presents a robust approach for an automatic speech recognition system (ASR) when both additive and convolutional noises corrupt the speech signal. Robust features are derived by assuming that the corrupting noise is stationary and the channel effect is fixed during the utterance. In the proposed method the effect of additive and convolutional distortions are minimized by two stage fi...

متن کامل

Is speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling?

Using deep neural networks (DNNs) for automatic speech recognition (ASR) has recently attracted much attention due to the large performance improvement they provide for a variety of tasks. DNNs are known to be robust to overfitting and to be able to remove speaker variability. Another important cause of variability in speech is the presence of noise. A lot of research has been undertaken on noi...

متن کامل

A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech

We propose a novel framework for noise robust automatic speech recognition (ASR) based on cochlear implant-like spectrally reduced speech (SRS). Two experimental protocols (EPs) are proposed in order to clarify the advantage of using SRS for noise robust ASR. These two EPs assess the SRS in both the training and testing environments. Speech enhancement was used in one of two EPs to improve the ...

متن کامل

Optimizing the Implementation of MMSE Enhancement for Robust Speech Recognition

In this paper several methods are proposed to optimize the implementation of minimum mean-square error (MMSE) estimation algorithm for robust automatic speech recognition (ASR). In the calculation of MMSE enhancement algorithm, the original confluent hyper-geometric function is approximated by a piece-wise linear function, which greatly reduces the computation load while keep the same performan...

متن کامل

From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR

The multi-band processing paradigm for noise robust ASR was originally motivated by the observation that human recognition appears to be based on independent processing of separate frequency sub-bands, and also by “missing data” results which have shown that ASR can be made significantly more robust to band-limited noise if noisy sub-bands can be detected and then ignored. Of the different mult...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003